Goto

Collaborating Authors

 Niigata Prefecture


Tokyo rally urges return of all Japanese abductees held in North Korea

The Japan Times

Sakie Yokota (center, back), mother of North Korean abductee Megumi Yokota, and others attend a rally held in Tokyo on Saturday that called for the immediate return of Japanese people abducted by North Korea. A large-scale rally was held in Tokyo on Saturday to seek the immediate return home of all Japanese abductees in North Korea. Relatives of those abducted to North Korea decades ago expressed hopes for the return of abductees immediately and while their parents are still alive. The event, organized by the association of families of abduction victims and other entities, was attended by about 800 people, including Prime Minister Sanae Takaichi. "We will never give up," said Takuya Yokota, 57, head of the association and the younger brother of Megumi Yokota, who was abducted in 1977 at the age of 13. He called on North Korean leader Kim Jong Un to release all abductees to "chart a bright future for both countries."


TEPCO reports error at Kashiwazaki-Kariwa nuclear plant

The Japan Times

Tokyo Electric Power Company Holdings (Tepco) said Saturday that an alert system did not work during a test operation held the day prior as part of the restart of the No. 6 reactor at its Kashiwazaki-Kariwa nuclear plant in Niigata Prefecture. The company is working to identify the cause of the problem, but failure to resolve it soon may affect its plan to restart the reactor on Tuesday. According to Tepco, the problem was confirmed at 12:36 p.m., and it stopped the test operation. The alert system is designed to activate when a control rod is being pulled out of the reactor while another rod is already out. The Kashiwazaki-Kariwa reactor would be the first of Tepco's nuclear reactors to be restarted since the March 2011 accident at its tsunami-crippled Fukushima No. 1 nuclear plant.


Toward an Insider Threat Education Platform: A Theoretical Literature Review

arXiv.org Artificial Intelligence

Insider threats (InTs) within organizations are small in number but have a disproportionate ability to damage systems, information, and infrastructure. Existing InT research studies the problem from psychological, technical, and educational perspectives. Proposed theories include research on psychological indicators, machine learning, user behavioral log analysis, and educational methods to teach employees recognition and mitigation techniques. Because InTs are a human problem, training methods that address InT detection from a behavioral perspective are critical. While numerous technological and psychological theories exist on detection, prevention, and mitigation, few training methods prioritize psychological indicators. This literature review studied peer-reviewed, InT research organized by subtopic and extracted critical theories from psychological, technical, and educational disciplines. In doing so, this is the first study to comprehensively organize research across all three approaches in a manner which properly informs the development of an InT education platform.


Compression Method for Solar Polarization Spectra Collected from Hinode SOT/SP Observations

arXiv.org Artificial Intelligence

The complex structure and extensive details of solar spectral data, combined with a recent surge in volume, present significant processing challenges. To address this, we propose a deep learning-based compression technique using deep autoencoder (DAE) and 1D-convolutional autoencoder (CAE) models developed with Hinode SOT/SP data. We focused on compressing Stokes I and V polarization spectra from the quiet Sun, as well as from active regions, providing a novel insight into comprehensive spectral analysis by incorporating spectra from extreme magnetic fields. The results indicate that the CAE model outperforms the DAE model in reconstructing Stokes profiles, demonstrating greater robustness and achieving reconstruction errors around the observational noise level. The proposed method has proven effective in compressing Stokes I and V spectra from both the quiet Sun and active regions, highlighting its potential for impactful applications in solar spectral analysis, such as detection of unusual spectral signals.


Wireless Link Quality Estimation Using LSTM Model

arXiv.org Artificial Intelligence

In recent years, various services have been provided through high-speed and high-capacity wireless networks on mobile communication devices, necessitating stable communication regardless of indoor or outdoor environments. To achieve stable communication, it is essential to implement proactive measures, such as switching to an alternative path and ensuring data buffering before the communication quality becomes unstable. The technology of Wireless Link Quality Estimation (WLQE), which predicts the communication quality of wireless networks in advance, plays a crucial role in this context. In this paper, we propose a novel WLQE model for estimating the communication quality of wireless networks by leveraging sequential information. Our proposed method is based on Long Short-Term Memory (LSTM), enabling highly accurate estimation by considering the sequential information of link quality. We conducted a comparative evaluation with the conventional model, stacked autoencoder-based link quality estimator (LQE-SAE), using a dataset recorded in real-world environmental conditions. Our LSTM-based LQE model demonstrates its superiority, achieving a 4.0% higher accuracy and a 4.6% higher macro-F1 score than the LQE-SAE model in the evaluation.


MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines

arXiv.org Artificial Intelligence

The advent of Large Language Models (LLMs) has paved the way for AI search engines, e.g., SearchGPT, showcasing a new paradigm in human-internet interaction. However, most current AI search engines are limited to text-only settings, neglecting the multimodal user queries and the text-image interleaved nature of website information. Recently, Large Multimodal Models (LMMs) have made impressive strides. Yet, whether they can function as AI search engines remains under-explored, leaving the potential of LMMs in multimodal search an open question. To this end, we first design a delicate pipeline, MMSearch-Engine, to empower any LMMs with multimodal search capabilities. On top of this, we introduce MMSearch, a comprehensive evaluation benchmark to assess the multimodal search performance of LMMs. The curated dataset contains 300 manually collected instances spanning 14 subfields, which involves no overlap with the current LMMs' training data, ensuring the correct answer can only be obtained within searching. By using MMSearch-Engine, the LMMs are evaluated by performing three individual tasks (requery, rerank, and summarization), and one challenging end-to-end task with a complete searching process. We conduct extensive experiments on closed-source and open-source LMMs. Among all tested models, GPT-4o with MMSearch-Engine achieves the best results, which surpasses the commercial product, Perplexity Pro, in the end-to-end task, demonstrating the effectiveness of our proposed pipeline. We further present error analysis to unveil current LMMs still struggle to fully grasp the multimodal search tasks, and conduct ablation study to indicate the potential of scaling test-time computation for AI search engine. We hope MMSearch may provide unique insights to guide the future development of multimodal AI search engine. Project Page: https://mmsearch.github.io


Japan will test self-driving bullet trains in 2028

Popular Science

One of Japan's largest railway companies wants fully self-driving bullet trains speeding through the country by the mid-2030's. According to East Japan Railway (JR East), at least one prototype will debut in 2028. The company revealed its plans on September 10, citing hopes to both streamline its operations and make them more sustainable. Japan's iconic Shinkansen railines, more commonly known as bullet trains, have been a staple of the nation's high-speed public transportation routes for over half a century. Traveling as fast as 300 km per hour (roughly 186 mph), the trains weave throughout the country's major urban areas, and are now completely electric as well as more lightweight than earlier models. According to The Japan Times, self-driving bullet trains have been a part of JR East's overall plans since at least 2018, when the company presented its "Transformation 2027" project framework.


Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data

arXiv.org Artificial Intelligence

Discrete diffusion models with absorbing processes have shown promise in language modeling. The key quantities to be estimated are the ratios between the marginal probabilities of two transitive states at all timesteps, called the concrete score. In this paper, we reveal that the concrete score in absorbing diffusion can be expressed as conditional probabilities of clean data, multiplied by a time-dependent scalar in an analytic form. Motivated by this finding, we propose reparameterized absorbing discrete diffusion (RADD), a dedicated diffusion model without time-condition that characterizes the time-independent conditional probabilities. Besides its simplicity, RADD can reduce the number of function evaluations (NFEs) by caching the output of the time-independent network when the noisy sample remains unchanged in a sampling interval. Empirically, RADD is up to 3.5 times faster while achieving similar performance with the strongest baseline. Built upon the new perspective of conditional distributions, we further unify absorbing discrete diffusion and any-order autoregressive models (AO-ARMs), showing that the upper bound on the negative log-likelihood for the diffusion model can be interpreted as an expected negative log-likelihood for AO-ARMs. Further, our RADD models achieve SOTA performance among diffusion models on 5 zero-shot language modeling benchmarks (measured by perplexity) at the GPT-2 scale. Our code is available at https://github.com/ML-GSAI/RADD.


Deciphering interventional dynamical causality from non-intervention systems

arXiv.org Machine Learning

Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. To address this challenge, we propose a framework named Interventional Dynamical Causality (IntDC) for such non-intervention systems, along with its computational criterion, Interventional Embedding Entropy (IEE), to quantify causality. The IEE criterion theoretically and numerically enables the deciphering of IntDC solely from observational (non-interventional) time-series data, without requiring any knowledge of dynamical models or real interventions in the considered system. Demonstrations of performance showed the accuracy and robustness of IEE on benchmark simulated systems as well as real-world systems, including the neural connectomes of C. elegans, COVID-19 transmission networks in Japan, and regulatory networks surrounding key circadian genes.


Uncertainty Management in the Construction of Knowledge Graphs: a Survey

arXiv.org Artificial Intelligence

Knowledge Graphs (KGs) are a major asset for companies thanks to their great flexibility in data representation and their numerous applications, e.g., vocabulary sharing, Q/A or recommendation systems. To build a KG it is a common practice to rely on automatic methods for extracting knowledge from various heterogeneous sources. But in a noisy and uncertain world, knowledge may not be reliable and conflicts between data sources may occur. Integrating unreliable data would directly impact the use of the KG, therefore such conflicts must be resolved. This could be done manually by selecting the best data to integrate. This first approach is highly accurate, but costly and time-consuming. That is why recent efforts focus on automatic approaches, which represents a challenging task since it requires handling the uncertainty of extracted knowledge throughout its integration into the KG. We survey state-of-the-art approaches in this direction and present constructions of both open and enterprise KGs and how their quality is maintained. We then describe different knowledge extraction methods, introducing additional uncertainty. We also discuss downstream tasks after knowledge acquisition, including KG completion using embedding models, knowledge alignment, and knowledge fusion in order to address the problem of knowledge uncertainty in KG construction. We conclude with a discussion on the remaining challenges and perspectives when constructing a KG taking into account uncertainty.